An Improved Accuracy of Multiclass Random Forest Classifier with Continuous Attribute Transformation Using Random Percentile Generation

نویسندگان

چکیده

This study aims to improve classification accuracy by transforming continuous attributes into categories randomly generating percentile values as categorization limits. Four algorithms were compared for the generation of and selected based on small variability distribution highest revenue expectations. The testing training data becomes second consideration. Random forest (RF) is modeled from percentiles with three transformation variations. results ANOVA test, algorithm variations transformation, has a mean that not significantly different best model original dataset model. However, in some data, RF attribute was superior effectiveness this very well applied LR, MLP, NB methods. In tuition fee dataset, application methods each had an 0.178, 0.204, 0.318. give significant increase 0.967, 0.949, 0.594 method, respectively. date fruits effective MLP method 0.193 (original attribute) 0.690 (continuous transformation). are effectively MPL, datasets categorical mixed attributes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Random Forest Classifier for Text Categorization

This paper proposes an improved random forest algorithm for classifying text data. This algorithm is particularly designed for analyzing very high dimensional data with multiple classes whose well-known representative data is text corpus. A novel feature weighting method and tree selection method are developed and synergistically served for making random forest framework well suited to categori...

متن کامل

Thresholding a Random Forest Classifier

The original Random Forest derives the final result with respect to the number of leaf nodes voted for the corresponding class. Each leaf node is treated equally and the class with the most number of votes wins. Certain leaf nodes in the topology have better classification accuracies and others often lead to a wrong decision. Also the performance of the forest for different classes differs due ...

متن کامل

Attribute bagging: improving accuracy of classifier ensembles by using random feature subsets

We present attribute bagging (AB), a technique for improving the accuracy and stability of classi#er ensembles induced using random subsets of features. AB is a wrapper method that can be used with any learning algorithm. It establishes an appropriate attribute subset size and then randomly selects subsets of features, creating projections of the training set on which the ensemble classi#ers ar...

متن کامل

Automated epileptic seizure detection using improved correlation-based feature selection with random forest classifier

Analysis of electroencephalogram (EEG) signal is crucial due to its non-stationary characteristics, which could lead the way to proper detection method for the treatment of patients with neurological abnormalities, especially for epilepsy. The performance of EEG-based epileptic seizure detection relies largely on the quality of selected features from an EEG data that characterize seizure activi...

متن کامل

Segmentation of retinal OCT images using a random forest classifier

Optical coherence tomography (OCT) has become one of the most common tools for diagnosis of retinal abnormalities. Both retinal morphology and layer thickness can provide important information to aid in the differential diagnosis of these abnormalities. Automatic segmentation methods are essential to providing these thickness measurements since the manual delineation of each layer is cumbersome...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal on Advanced Science, Engineering and Information Technology

سال: 2023

ISSN: ['2088-5334', '2460-6952']

DOI: https://doi.org/10.18517/ijaseit.13.3.18379